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ABSTRACT 



Factors affecting a lower-bound estimate of internal 



consistency reliability, Cronbach's coefficient alpha, are explored. 
Theoretically, coefficient alpha is an estimate of the correlation 
between two tests drawn at random from a pool of items like the items 
in the test under consideration. As a practical matter, coefficient 
alpha can be an index of internal consistency (i.e., an index of the 
degree to which item response scores correlate with total test 
scores). Cronbach's procedure is a method of calculating coefficient 
alpha. Three factors affecting coefficient alpha — total test 
variance, sum of item variances, and homogeneity of item 
difficulty — were examined using a mini Monte Carlo model. Small data 
sets generated by a computer program were used to make the discussion 
concrete. By considering bivariate correlation, multiple R 
correlation, beta-weight, and structure coefficients, total test 
variance was shown to account for the most variance of coefficient 
alpha, followed by the standard deviation of item difficulties, and 
the sum of item variances. Seven tables and three figures illustrate 
the discussion. (SLD) 
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ABSTRACT 

The paper explores the factors that Influence coefficient alpha as 
a lower bound estimate of score reliability. The purpose of the 
paper Is to couiunlcate the relative Influence on alpha of various 
factors. Small data sets generated by a computer program are 
employed to make the discussion concrete and readily accessible. 
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Reliability Is critical In datacting affects In substantive 
research. For exaaple, if a dependent variable Is neasured such 
that It is perfectly unreliable, the effect size in the study will 
unavoidably be zero, and the results will not be statistically 
significant at any sanple size, including an infinite one. As 
Locke, Spirduso and Silverman (1987, p. 28) note, "the correlation 
between scores from two tests cannot exceed the square root of the 
product for reliability in each test." Thus, if a researcher is 
correlating scores having a reliability of .9 with scores having a 
reliability of .6, the correlation cannot exceed .73. 
Prospectively, researchers must select measures that will allow 
detection of effects at the level desired; retrospectively, 
researchers must take reliability into account when interpreting 
findings. 

Meier and Davis (1990), however, recently reported that 
published studies often do not adequately report reliability 
estimates so that results can be carefully evaluated. This has 
historically been the case with* respect to published research. For 
example, Willson (1980) found that almost half the published 
studies he examined did not report reliability information. The 
same pattern seems to occur in dissertation research (LaGaccia, 
1991) . 

The present paper explores the factors affecting a lower-bound 
estimate of internal consistency reliability, Cronbach's 
coefficient alpha. One aspect of this treatment involved use of a 
BASIC computer program (Thompson, 1990) that implements a small 
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seal* Montft Carlo study for heuristic purposes. 
Wh^t is eo affialent aloha? 

Ifhen an estlinate of the reliability of scores on a test is 
needed and the r>arall«l for is and test-retest approaches are 
inpractical, researchers typically rely on internal consistency 
coefficients such as coefficient alpha (Feldt, Woodruff, & Salih, 
1987) . Theoretically, coefficient alpha is an estimate of the 
correlation expected between two tests dra%m at random from a pool 
of items like the items in the test under consideration. 
Practically, coefficient alpha can be used as an index of internal 
consistency, i.e., an index of the degree to which item response 
scores (e.g., "O" or "1" in achievement testing) correlate with 
total test scores. Crocker and Algina (1986, p. 142) describe 
coefficient alpha as 

...not a direct estimate of the [theoretical] 
reliability coefficient but rather an estimate of 
the lower bound of that coef ficient .Alpha is the 
mean of ail possible split-half coefficients that 
are calculated. 

Alpha, therefore, can be interpreted as the lower bound estimate of 
the proportion of variance in the test scores explained by common 
factors underlying item performance. 

Alpha is superior to the use of the split-half estimate of 
internal consistency, because for most tests of any length there 
are usually many splits, and the estimates associated with 
different splits for the same data may well yield contradictory 
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results. As Brownell (1933) pointed out long ago, for a test with 

k iteas, thers are (.5(ki))/[(.5(k) ) i splits of the iteas. For 

•xaaple, for e test with six items, the number of splits (1«2,3 vs 

4,5,6, 1,2,4 vs 3,5,6, etc.) equals: 

■ s f6 X 5 « 4 X 3 X 2 X 1^ - .5 X 72Q -1^-10 
(3x2xlp 6^ 36 

For a test with k-2 items there is one unique split. For a 

test tfith k»4 items there are three splits. For a test with six 

items, there are 10 splits, as indicated previously. For a test 

with 10 items there are 126 splits. Clearly the number of splits 

(and the likelihood of contradictory results acroas splits) 

escalates dramatically as items are added, even for relatively 

short tests. 

Coefficient alpha has the saae value no natter which method of 
compute cion is used. With this advantage, however, comes some 
cautions when interpreting alpha. Crocker and Algina (1986) warn 
that a relatively high value of coefficient alpha is not related to 
stability of the test scores over time, or to their equivalence to 
scores on one specific alternate form of the test, or to the 
unidemensionality of test items (i.e., performance on these items 
cannot necessarily be explained in terms of a single underlying 
factor) . 

This last limitation actually represents a common 
methodological error made by many researchers. A large alpha does 
not indicate that the test is unidimensional, and cannot be factor 
analyzed, notwithstanding the propensity of some researchers to 
foirmulate exactly this interpretation. 



now Im aomt tLeimnt alpi»« oKmlafd? 

' Three procedures used to calculate coefficient alpha 
determined from a single adninistration of a test are (a) Hoyt's 
analysis of variancs, (b) Cronbach's alpha, and (c) Kuder- 
Richardson formula #20. All three methods yield identical results. 
Hoyt's method (Hoyt, 1941) is based on the analysis of variance, 
treating persons and items as sources of variation. Cronbach's 
alpha (Cronbach, 1951) uses item variance, total test variance, and 
the length of the test to compute an estimate of the incernal 
consistency of items which are either dichotomously scored (i.e., 
scored either "0" or "1"), or which have a wide range of scoring 
weights. Cronbach's alpha is computed by the formula: 

j^I 4^ (V > 0, K > 1) 

where X is the number of items on the test, Za^^ is the sum of item 
variances, and a^^ is the total test variance. 

The Kuder-Richardson formulat #20 (Kuder & Richardson, 1937) 
is equivalent to Cronbach's alpha; however, KR 20 can only be used 
with dichotomously scored items. The KR 20 formula is identical to 
Cronbach's except the sum of item variances can be computed by 
using a simplified formula, a^^ = pq, where p is the proportion of 
subjects correctly answering a given item (sometimes called item 
difficulty) and q (the proportion of subjects answering the item 
wrong) is 1 - p: 

o - Jl. * (1 - SEfll 
k-1 

For a complete algebraic proof of the formula, a^^ = pq, for 
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dictomously scored Itens, see Crocker and Alglna (1986, pp. 90*92). 

For dlchotonously scored Items, item difficulty (p) can only 
range betveen 0 and 1 Inclusive, since for a given Item j no fever 
than a proportion of 0 (or 0%) of the subjects can get the Item 
right and no more than a proportion of 1 (or 100 1) of the subjects 
can get the Item right. Though It Is perhaps counter -Intuitive, 
larger p-values Indicate easier Items, and smaller p-vr\ues 
describe more difficult Items. KR 20 will be used to calculate 
coefficient alpha in this study because all of the test items used 
are dichotomously scored. 
What factors effect coefficient alpha? 

Both the characteristics of the person sample selected and the 
characteristics of the test items can affect coefficient alpha. 
For example, if a group of examinees is homogeneous with respect to 
knowledge about physics, that is they have similar competencies in 
this area, then one would expect them to score about the same on a 
test of physics concepts. In this example, the variability of 
total test scores would be small because most examinees would have 
a similar score. This would result in dividing each item variance 
by a smaller niunber, yielding a larger result that is then 
subtracted from one, and thereby yielding a smaller estimated 
reliability, as can be seen by examining the KR 20 formula. Thus, 
though it is not widely recognized by researchers, the nature (and 
even the size) of the sample of subjects can impact the estimated 
classical reliability coefficient. 

In considering the characteristics of the test that may affect 
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coefficient alpha, it is inportant to note that a test is not 
reliable or unreliable; rather, "reliability is a property of the 
scores on a test for a particular group of examinees'* (Crocker & 
Algina, 1986, p. 144, emphasis added). Depending on the types of 
iteas added, increasing the length of the test usually increases 
coefficient alpha, that is if (and only if) the items are of equal 
or better quality than the other items on the test. The more items 
that are selected from the "pool of items," the less likely that 
the item sample will be biased, or will underestimate the 
theoretical coefficient alpha. 

In the computational equation for coefficient alpha, the k/ (k-* 
1) term corrects for this bias; when the number of items on the 
test, increases, then this correction term gets smaller and 
smaller. For example, for a 2* item test, the correction multiplier 
would be (2/(2-1)) or 2; with a 10-item test, k equals 1.11; with 
a 100- item test, k equals 1.0101. There is a point when adding 
items will help less and less to increase coefficient alpha as 
regards this correction aspect of the formula. 

Three of the factors that affect coefficient alpha, total test 
variance, sum of item variances, and homogeneity of item 
difficulty, p, will be examined in the present study using a mini 
Monte Carlo model. The Monte Carlo method is a technique for 
obtaining an approximate solution to certain mathematical and 
logical problems; it characteristically involves randomly sampling 
from some specified and known universe. The process is usually 
done on a computer, as it was in this study. 
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Characteristics cf the modelled Itens sets are easily 
manipulated using this method; the method allows use of a vide 
variety of Item set types to help Increase the generallzablllty of 
the results. For the present study, /our population target mstricBS 
or models were used to manipulate the combination of (a) total test 
vari ance , (b) sum of Item variances , and (c) homogeneity/ 
heterogeneity of Item difficulty for a hypothetical 7-item, 
dlchotomously scored test with 10 examinees. The purpose of this 
Monte Carlo modeling was to Invest igate the relative effects of 
total test varian ce, item variance, and homogeneity of — i^£ffi 
difficulty with res pect to their influences on rellabllltv. 

Each population target model creates the conditions necessa :y 
to maximize, minimize or moderate each of the three factors that 
affect coefficient alpha. As part of the random sampling process, 
a probability is specified to influence the degree to which random 
samples will reflect each parameter in the population. For 
example, if the assigned probability is .2 for item 2 of person 2, 
then when a random number is generated to determine whether the 
score for this item in a given random sample will be a **0" or a 
**1,** on the average 20 percent of the time thii. item will be scored 
1 and 80 percent of the time this item will be scored a 0. Table 
1 presents the four population models used in the study. 

Insert Table 1 about here 

As can be determined by consulting Table 1, the four models 
create various patterns in the total test variance, sum of item 
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variances, and homogeneity/heterogeneity of item difficulty. Model 
1 produces «axi»u» total score variance, aaxiaun item variance, and 
hoaogeneoua itea difficulty. Model 2 generates minimun total score 
variance, ■axi»u» ite« variance, and homogeneous item difficulty. 
Nodtl 3 produces moderate total score variance, moderate sums of 
the item variances, and heterogeneous item difficulty. Model 4 
generates minimum total variance, moderate sums of items variances, 
and heterogeneous item difficulty. 
Bow oan totel test vrlanee be magimiiedl 

On a 7-iten, dichotomously scored test with 10 examinees, the 
total test score for each examinee ranges from 0 to 7. Figure 1 
uses histograms to illustrate various total test score combinations 
and their corresponding total test variance. When everyone gets a 
score of "l" for each item, each examinee's total test score is 7. 
The item difficulty, p, for each item in this example is 1 and the 
total test variance, Oj^, is 0, because there is no "spreadoutness- 
in the scores. Similarly, when everyone gets i score of 0 for each 
item, each examinee's total ecore is 0, item difficulty, p, for 
each item is 0, and once again, total test variance, ffj^, is 0, 
because there is no "spreadoutr.ess" in the scores. 



Insert Figure 1 about here 



The third histogram in Figure 1 shows that when each examinee 
earns a different score (i.e., all of the possible total test 
scores are represented), then the total test variance is 
approximately 3. The maximum total test variance possible on a 7- 
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it«i, dichtoBously scored test with 10 examinees, however, ie «uch 
higher than 3. Total teat variance is ■aximized when the sua of 
squares is naxiaized. This occurs when the total test scores are 
"spreadout" the aost, or in other words, vhOT hfllf Qf thfl gXflBingeg 

««rn th« iq tf ftfit BC"r«> fhere if 8 Q) and half earn 

th« hiaheat poaslble t gtfll T"^* • °' 

examinees get all of the items "correct" and half "miss" all of the 
items, then the total test variance is maximized as 12.25. The 
range of the total test variance on a 7-item dichotomously-scored 
test, then, is from 0 to 12.25. 

By examining the total test scores of the gxpggtgd <?r 
population model in Table 2b, one can determine that Model 1 will 
maximize the total test variance, because half of the examinees in 
the population earn total test scores of 0 and half earn total test 
scores of 7. Table 3b shows that Model 2 produces a minimal total 
test variance, because half of the examinees earn a total test 
score of 3 and the other half earn a total test score of 4. The 
minimal "spreadoutness" yeilds a total -est variance of .25 in the 
actual population from which the 10 random samples for Model 2 were 
drawn. 



Insert Tables 2 and 3 about here 



Table 4b demonstrates that Model 3 yeilds a moderate total 
test variance of 3.36, because half of the examinees in the 
population earn a total test score of 0, one-forth earn a total 
test score of 3, and one-forth earn a total test score of 4. Table 



5b show* that Mod«l 4, lik« Modal 2, BiniBizes th« total tast 
varianca. In the population fro» which the 10 data sets were 
randomly drawn for Model 4, all of the examinees earn a total test 
score of 2. With no "spreadoutness" in the scores, the total test 
variance is 0. 

Insert Tables 4 and 5 aK>out here 

How on thm mum of Ifc— vriencae be ■a^ialied? 

A method similar to the one just described to find maximum 
total test variance can be used to determine the maximum sum of 
item variances. Consider the scores on a given item j to be 
analogous to the scores on the total test. To maximize the 
variance of a single item j, one must maximize the "spreadoutness" 
of scores which is measured by using the the sum of squ-sres. From 
the previous discussion, it was shown that if half of the scores 
are at one extreme of the range and half are at the other extreme, 
then the sum of squares, and therefore, the variance, is maximized. 
Since the possible scores for aach examinee on a given item j are 
either "0" or "1", then to maximize the item's variance, half of 
the examinees would earn a 0 and half would earn a 1. 

Figure 2 shows that items with moderate difficulty (i.e., p « 
.5) maximize item variance and items that are either easier (e.g., 
p - 1) fit harder (e.g., p " 0) tend to minimize item variance. 
Item variances can range from 0 to .25; therefore, the maximum sum 
of item variances for a 7-item, lichotomously scored test is (.25 
times 7) or 1.75. 

10 
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Ins«rt Plgur* 2 about h«r« 

Tabl« 2b shows that tha axpacted or population Bodal for Nodal 
1 yailds tha Baxiauii sua of itaa variancas (1.75) bacausa half of 
tha axaainaas on auch itaa aarn a 0 and half aarn a 1. Similarly , 
Tabla 3b shows that Modal 2 aaxlnlzes tha sum of itaa variancas by 
tha saaa aethod. Tabla 4b indicates that Model 3 produces a 
■oderata sun of itaa variances (£0|^^ - .96) , because on half of tiie 
items everyone earned a 0 and on the other half, half of the 
examinees earned a 0 and half earned a 1. A similar procedure was 
used in Model 4 (Table 5b) to yield a moderate sum of item 
variances (Zo^^ * 1.0). 

What does homoaana oua d and hataroQana.aus p aaan? 

Homogeneous p, or difficulty level , means that all the items 
have the same or similar difficulty level. This is equivalent to 
having the standard deviation of p be either zero or be very small. 
Models 1 and 2 both illustrate homogeneous p, because for each 
item, p equals .5. "^his means that half of the examinees earn a 0 
on each item and half earn a 1. 

Item difficulty, p, is heterogeneous when the items vary in 

difficulty, and thus, tha standard deviation of p would be large. 

Both Models 3 and 4 illustrate heterogeneous p because half of the 

items have a p-value of .5 and the other half have a p-va'.ue of 0. 

How do total test variance, sum of l t»» variancas. ififl 

homoaanaitY/hataroaanaitY of it ain difficulty affect alpha? 

Using the mini Monte Carlo method, 10 random samples of the 

population models were generated to examine the affects on 
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coefficient alpha of aanlpulatlng the three target factors: (a) 
total teet variance, (b) sua of Itea variance, and (b) homogeneity/ 
heterogeneity of Itea difficulty. Table 6 displays the results of 
the study, listing the aean of the alpha coefficients generated by 
the 10 randoe sanples for each population nodel, along with the 
standard deviation, and BininuB and aaxlaua values for the sampled 
alphas. The table also lists the mean, standard deviation, and 
Binlaum and maximum values for the total test variances (TESTVAR) , 
the sums of the item variances (IVARSUM), and the standard 
deviation of the item difficulties (ITEMPSD) . 

Insert Table 6 about here 

Model 1 (maximum TESTVAR, maxlmvun IVARSUM, small ITEMPSD) 
tended to yeild the highest alpha coefficients. The mean of the 10 
alphas for Model 1 was .94, with a very siuall standard deviation of 
.026, Indicating that most of the alpha coefficients clustered 
close to .94. Model 2 (moderate TESTVAR, moderate IVARSUM, small 
ITEMPSD) produced the second highest alpha coefficients (a « .656, 
a ' .138). Model 2 (minimum, maximum, homogeneous) and Model 4 
(minimum, moderate, heterogeneous) yielded the lowest coefficient 
alpha levels. In fact, both of these mean alpha values were 
negative (Model 2, o - -.146, SD = .377; Model 4, a » -.232, SD » 
.509) . 

These last two results make the point that alpha can be 
negative. Indeed, the population model reported : .i Table 3b yields 
a value of -7.0. These results make the point that alpha is a 
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lower bound on a true r)%l lability estimate, since the coefficient 
does not behave as a correlation coefficient would. 

FroB these results several tentative conclusions can be drawn. 
First, aoderate to aaxiaua total test variance Is Important to 
■axlalze coefficient alpha. It appears that total test variance 
accounts for much of the variance that explains coefficient alpha. 
Second, total test variance appears to have more affect on alpha 
that the sun of Item variances. Third, the homogeneity/ 
heterogeneity of item difficulty, p, seems to have minimal effect 
on alpha. To further define these tentative conclusions, the 
relationship of total test variance, sum of item variances, and 
item difficulty must be considered. 

HOW can the relatio n«hlp of total varianee. sum of Item 

ygriances- and It— difficu lty be inve8*£iQated? 

To determine the portion of variance of alpha explained by 
each of the factors alone across the 40 sets of results (4 models 
X 10 random samples of results for each) , the bivariate 
correlations between alpha and each of the factors separately can 
be computed. If Y is alpha, X3 is total test variance, Xj is siua 
of item variances, and is the standard deviation of item 
difficulties, then for the data in the present study ry^j is .772, 
ry„2 ^8 -O^^' '^^^ ^yxi -.317. By squaring each of these 
bivariate correlation coefficients, one can determine the percent 
of varianco that each uniquely contributes to alpha. 

The factor, total test variance, explains 60% of the variance 
of alpha (r^ = .5959); the factor, sum of item variances, explains 
less than 1% of the variance of alpha (r^ = .0069); and the 

13 



stAnd&rd deviation of Itra difficulties sxplains 10% of th« 
varlaac* of alpha (r^ - .10). Pro« thl« brlaf analysis, It appears 
that total test variance has the greatest affect on alpha, followed 
by the standard deviation of lte« difficulties, and finally, the 
sum of lt<ui variances. 

A multiple regression analysis can assist In Investigating the 
collective and separate contributions of the three Independent 
variables, total test variance, sua of Iten variances, and the 
standard deviation of item difficulties, to the variation of the 
dependent variable, coefficient alpha (Howell, 1982, pp. 414-419). 
The iBultiple R correlation coefficient expresses the magnitude of 
the relation between th«< best possible combination of all 
independent variables (X^, Xj, X3) and the dependent variable (Y) 
(Kerlinger, 1979, pp. 171-172). R^, which is similar to r^, is the 
proportion of variance of Y accounted for by the regression 
combination of all the independent variables, excluding double 
counting of any area of Y jointly predicted by two or more of the 
predictors. Since in the present example R^ was .73298, then 73% 
of the variance of alpha is explained by the combination of X^ and 
X2 and X3 . 

The regression equation or prediction equation from the data 
in this study is: 

Y' » .189984 1:3 - 1.067318 Xj + .57629 X^ + 1.092742 
where Y« is predicted alpha, X3 is total test variance, Xj is sum 
of item variances, and X^ is the standard deviation of item 
difficulties. These results seem to suggest the incorrect 

14 



-1 f ) 



conclusion that th« sua of th« it«i variances is th« best predictor 
of alpha. 

But the B-weight coefficients for each of the independent 
variables cannot be compared because their units do not have the 
saae scales of aeasursBent. To alleviate this problea, the 
regression equation can be written in Z-score fom (Howell, 1982, 
pp. 419-420). By converting the B-weights to B-weights by using 
the fcraula 6 - b (SD,/SDy) , the effects of the different 
measurement scales are washed out, and the B-weight coefficients 
can be compared. The new regression equation in Z-score form is: 

Z'y - 1.014169 Z3 - .395216 Zj + .053201 + 0 

The unthoughtful researcher may wish to discuss the relative 
influence each independent variable has on alpha by considering 
only the fl-weight coefficients; however, Thompson and Borrello 
(1985) point out that especially with small to moderate sample 
sizes, 6-weight coefficients will fluctuate. This fluctuation is 
due to the back and forth distribution of shared variance among 
independent variables. In any case, these weights are influenced by 
the correlation among the predictor variables, and should not be 
the sole basis for interpreting regression results. 

When predictor variables are correlated with each other, as 
most are, then "collinearity" or "multicollinearity" exists. 
Kerlinger (1979, p. 165) notes that "independent variables [in 
educational psychology research] .. .are aiaost always correlated, 
often substantially so." In this study, the independent variables 
are correlated with each other, and thus the 6-weight coefficients 
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should not be th« only indic«s to us« when Interpreting relative 
Influence on alpha of these three factors. Pedhazur (1982, p. 246) 
reiterates this point by stating that "the presence of high 
Bulticol linearity poses serious threats to the interpretation of 
regression coefficients [fi-weights] as indices of effects." 

Thompson and Borello (1985, p. 208) note that "structure 
coefficients are not suppressed or inflated by collinearity " 
These coefficients, unlike 6-weight coefficients, can be used to 
describe the dependent variable in terms of its relationships with 
the independent variables. The structure coefficient for a 
predictor variable is equal to the bivariate correlation between 
the predictor variable and the dependent variable divided by the 
multiple R correlation coefficient: 

structure coefficient of = ^yxl/^ 
The structure coefficients for each of the independent variables in 
this study were .90172 for Xj, .09695 for Xj, and -.37027 for X^. 

Table 7 presents the rank orders of the influences on the 40 
alphas for each factor using B-weight coefficients and structure 
coefficients. Both coefficients suggest that total test variance 
is the most important influence on alpha. The structure 
coefficients indicate that the standard deviation of item 
difficulties ranked second in influence and the sun of item 
varia.ices ranked third. 

Insert Table 7 about here 

By using the three predictor variables of total test variance. 
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•UB Of itma variances, and tha standard daviation of ita» 
dlf f Icultlas, coaff iciant alpha was raasonably accurataly pradictad 
(r' - V3.3%) for ttoasa 40 sats of data. Figura 3 illustratas the 
ralatlvaly cloaa fit batwaan tha 40 obsarvad and pradictad alpha 
valuas. 

Insart Figura 3 about hara 

Using data generated by a slnl Monte Carlo sethod BASIC 
computer program, the present study examined three factors (total 
test variance, sum of Item variances, and the standard deviation of 
Item difficulties) that affect coefficient alpha. By considering 
blvarlate correlation, multiple R correlation, B-welght, and 
structure coefficients, total test variance was show to accoxint 
for the most variance of coefficient alpha, followed by the 
standard deviation of Item difficulties, and the sum of Item 
variances. 

There are two reasons to suspect that these results will 
generalize. The first reason Is empirical . The four population 
models represent a wide range of possible results that can occur in 
practice, suggesting that the ballpark of possibilities was covered 
In the mini Monte Carlo study. The second reason Is tn^orgtigfll « As 
can be surmised by examining the formula for KR 20, because total 
test variance can range widely In numerical values especially as 
test length Increases, It should be expected that total test 
variance can dominate alpha. Thus, the results and the expository 
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trMtMnt h«r« suag^st that rttSttarch«irs and practition«r« should 
ganarally focus on total scora varianca whan thay ara trying to 
Baximisa alpha. 
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Tabl* 1 

Four Models that Manipulate Factors Affsctlng Cosfflclont Alpha 



Factor Model 1 Nodsl 2 Nodal 3 Nodal 4 



Total Teat Variance MAX MIN NOD NIN 

Sub of Item Variances MAX M\X NOD NOD 

Honogenelty/ HON HON HET HET 
Heterogeneity of p 



Tabl« 3 



Probability Targat Matrix (a) , Population Nodal Matrix (b) , and One 
of th« 10 Randoaly Samplad Matricas (c) for Modal 1 

a. MAX TEST vAR (G) , MAX ITEM VAR (B) , HOMOGENEOUS p (G) 
PROBABILITY TARGET MATRIX 



1 


.1 


.1 


.1 


.1 








2 


.1 


.1 


.1 


.1 








3 


.1 


.1 


.1 


.1 








4 


.1 


.1 


.1 


.1 








5 


.1 


.1 


.1 


.1 








6 


.9 


.9 


.9 


.9 






.9 


7 


.9 


.9 


.9 


.9 


.9 




.9 


8 


.9 


.9 


.9 


.9 


.9 




.9 


9 


.9 


.9 


.9 


.9 


.9 


.9 


.9 


10 


.9 


<9 


.9 


.9 


.9 


.9 


.9 



b. MAX TEST VAR (G) , MAX ITEM VAR (B) , HOMOGENEOUS p (G) 
Kno%m Results for the Population 
n 1 2 3 4 5 5 2 TQI&L 



10000000 0 

20000000 0 

30000000 0 

40000000 0 

50000000 0 

61111111 7 

71111111 7 

81111111 7 

91111111 7 

12 1 1 1 1 1 1 1 1 

p .5 .5 .5 .5 .5 .5 .5 

vari .25 .25 .25 .25 .25 .25 .25 12.25 

ALPHA » 1.166667 TIM^IS (1 - (1.75 / 12.25)) » 1 

c. MAX TEST VAR (G) , MAX ITEM VAR (B) , HOMOGENEOUS p (G) 

n 1 2 3 4 5 6 2 IQIAL 

10000010 1 

20000000 0 

31000000 1 

40000000 0 

50001000 1 

61111110 6 

71111111 7 

81111111 7 

91111111 7 

Ifl 1 1 1 1 1 1 1 1 

p .6 .5 .5 .6 .5 .6 .4 

var. .24 .25 .25 .24 .25 .24 .24 9.809999 ALPHA » 

1.166667 TIMES (1 - (1.71 / 9.809999)) - .9633028 
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Tabltt 3 



Probability Targat Katrlx (a) , Population Modal Matrix (b) , and One 
of tha 10 Randoaly Saaplad Natrican (c) for Modal 2 

a. MIN TBST VAR (B) , MAX ITM VAR (B) , HOMOGENEOUS p (6) 
PROBABILITY TARGET MATRIX 



1 


.6 


.1 


.6 


.1 


.6 


.1 


.6 


2 


.2 


.8 


.2 


.8 


.2 


.8 


.2 


3 


.8 


.2 


.8 


.2 


.8 


.2 


.8 


4 


.2 


.8 


.2 


.8 


.2 


.8 


.2 


5 


.8 


.2 


.8 


.2 


.8 


.2 


.8 


6 


.2 


.8 


.2 


.8 


.2 


.8 


.2 


7 


.8 


.2 


.8 


.2 


.8 


.2 


.8 


8 


.2 


.8 


.2 


.8 


.2 


.8 


.2 


9 


.8 


.2 


.8 


.2 


.8 


.2 


.8 


10 


.4 


.9 


.4 


.9 


.4 


.9 


.4 



b. MIN TEST VAR (B) , MAX ITEM VAR (B) , HOMOGENEOUS p (G) 
Known Results for the Population 
n 1 2 3 4 S $ 2 lOm 



1 
2 
3 
4 
5 
6 
7 
8 
9 

2SL 



1 
0 
1 
0 

1 

0 

1 

0 

1 



0 

1 

0 

1 

0 

1 

0 

1 

0 



1 
0 

1 

0 

1 

0 

1 

0 

1 



0 
1 
0 

1 

0 

1 

0 

1 

0 



1 
0 

1 

0 

1 

0 

1 

0 

1 

JL 



0 
1 
0 

1 

0 

1 

0 

1 

0 



1 
0 

1 

0 

1 

0 

1 

0 

1 

JL 



p .5 .5 .5 .5 .5 .5 .5 

var, .25 .25 .25 .25 .25 .25 .25 
ALPHA " 1.166667 TIMES (1 - (1.75 / .25)) - -7 



.25 



C. MIN TEST VAR (B) , MAX ITEM VAR (B) , HOMOGENEOUS p (G) 



n 
1 


1 


* 

0 


J 

1 


J — 
0 




1 


0 


1 


4 


2 


1 


1 


0 


1 


0 


1 


0 


4 


3 


1 


1 


1 


0 


1 


1 


1 


6 


4 


1 


1 


0 


1 


0 


1 


0 


4 


5 


1 


0 


1 


0 


0 


0 


1 


3 


6 


0 


1 


0 


1 


0 


1 


0 


3 


7 


1 


0 


1 


1 


1 


1 


1 


6 


8 


0 


0 


1 


0 


0 


1 


0 


2 


9 


1 


0 


1 


1 


1 


1 


1 


6 


10 


0 


1 


0 


1 


0 


1 


1 


4 


P 


.7 


.5 


.6 


.6 


.4 


.8 


.6 




varj^ . 


21 


.25 


.24 


.24 


.24 


.16 


24 


1.7< 


ALPHA 


» 1. 


166667 


TIMES 


(1 - 


(1.5b 


/ 1.76)) 


« 


.1193182 
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Tiblc 4 

Probability Tar^«t Matrix (a) , Population Modal Matrix (b) , and One 
of tha 10 Randomly 8aiq>lad Matr*ca« (c) for Modal 3 

a. MOD TEST VAR (N) , NOD ITEM VAR (M) , HETEROGENEOUS p (B) 
PROBABILZTY TARGET MATRIX 



1 


.1 .1 


.1 .1 








2 


.1 .1 


.1 .1 








3 


.1 .1 


.1 .1 








4 


.1 .1 


.1 .1 


.1 






5 


.1 .1 


.1 .1 


.1 






6 


.1 .1 


.1 .9 


.9 


.9 


.9 


7 


.1 .1 


.1 .1 


.9 


.9 


.9 


8 


.1 .1 


.1 .9 


.9 


.9 


.9 


9 


.1 .1 


.1 .1 


.9 


.9 


.9 


10 


.1 .1 


.1 .9 


.9 


.9 


.9 



b. MOD TEST VAR (M) , MOD ITEM VAR (M) , HETEROGENEOUS p (B) 
Known Results for the Population 



n 


1 


2 


3 


4 


5 


6 


7 


TOTAL 


1 


0 


0 


0 


0 


0 


0 


0 


0 


2 


0 


0 


0 


0 




0 


0 


0 


3 


0 


0 


0 


0 


0 


0 


0 


0 


4 


0 


0 


0 


0 


0 


0 


0 


0 


5 


0 


0 


0 


0 


0 


0 


0 


0 


6 


0 


0 


0 


1 








4 


7 


0 


0 


0 


0 








3 


8 


0 


0 


0 


1 








4 


9 


0 


0 


0 


0 








3 


10 


0 


0 


0 


1 








4 


p 


0 


0 


0 


.3 


.5 


.5 


.5 




var^^ 


0 


0 


0 


.21 


.25 


.25 


.25 


3.36 


ALPHA 




1.166667 


TIMES 


(1 - 


(.9600001 / 


3.36)) 


- .8333333 



C. MOD 


TEST 


VAR 


(M) , MOD ITEM VAR 


(M) , HETEROGENEOUS p 


n 


1 


2 


3 


4 


5 


6 


7 


TOTAL 


1 


0 


0 


0 


0 


1 


1 


0 


2 


2 


0 


0 


0 


0 


0 


0 


0 


0 


3 


0 


0 


0 


0 


0 


0 


0 


0 


4 


1 


0 


0 


0 


0 


0 


0 


1 


5 


0 


0 


0 


0 


0 


0 


0 


0 


C 


0 


0 


0 


1 


1 


0 


1 


3 


7 


0 


0 


1 


0 


1 


1 


1 


4 


8 


0 


0 


0 


1 


1 


1 


1 


4 


9 


0 


0 


0 


0 


0 


1 


0 


1 


19 


1 


0 


0 


1 


1 


1 


1 


5 


p 


.2 


0 




.3 


.5 


.5 


.4 


3.2 


vari . 16 


0 


.09 


.21 


.25 


.25 


.24 


ALPHA 


« 1.166667 


TIMES 


(1 - 


(1.2 


/ 3.2)) 


« .7291666 
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Tabl« 5 



Probability Target Matrix (a) , Population Model Matrix (b) , and 
of tha 10 Randoaly Saaplad Matriisaa (c) for Modal 4 



a. NIN TEST VAR (B) , MOO ITEM VAR 
PROBABILITY TARGET MATRIX 



(M), HETEROGENEOUS p (B) 



1 
2 
3 
4 
5 
6 
7 
8 
9 
10 



.1 
.2 
.2 
.2 
.2 
.2 
.2 
.2 
.2 
.3 



.1 
.2 
.2 
.2 
.2 
.2 
.2 
.2 
.2 
.3 



.1 
.2 
.2 
.2 
.2 
.2 
.2 
.2 
.2 
.3 



.2 
.2 
.2 
.2 
.2 
.8 
.8 
.8 
.8 
.8 



.7 
.8 
.8 
.8 
.8 
.2 
.2 
.2 
.2 
.3 



.2 
.2 
.2 
.2 
.2 
.8 
.8 
.8 
.8 
.8 



.7 
.8 
.8 
.8 
.8 
.2 
.2 
.2 
.2 
.3 



b. MIN TEST VAR (B) , MOD ITEM VAR (M) , HETEROGENEOUS p (B) 
Known Results for the Population 



n 


1 


2 


3 


4 


? 


6 




TOTAL 


I 


0 


0 


0 


0 


1 


0 




2 


2 


0 


0 


0 


0 


1 


0 




2 


3 


0 


0 


0 


0 


1 


0 




2 


4 


0 


0 


0 


0 


1 


0 




2 


5 


0 


0 


0 


0 


1 


0 




2 


6 


0 


0 


0 


1 


0 




0 


2 


7 


0 


0 


0 


1 


0 




0 


2 


8 


0 


0 


0 


1 


0 




0 


2 


9 


0 


0 


0 


1 


0 




0 


2 


10 


0 


0 


0 


1 






0 


2 


P 

vari 
ALPHA 


0 
0 

- 1. 


0 
0 

166667 


0 
0 

TIMES 


.5 
.25 
(1 - 


.5 
.25 
(1 / 


.5 
.25 
0)) - 


.5 
.25 
-999 


0 



C. MIN TEST VAR (B) , MO J ITEM VAR (M) , HETEROGENEOUS p (B) 
_I1 1 2 3 4 5 fi 2 lS2m 



1 
2 
3 
4 
5 
6 
7 
8 
9 
liL 



0 
0 
0 
0 
0 
0 
0 

1 

0 



0 
0 
0 
0 
0 
0 
0 
0 
0 

JL 



0 
0 
0 
0 
0 

1 

0 
0 
0 

A. 



1 

0 
0 
0 

1 
1 

0 

1 

0 



1 
1 
1 
1 

0 
0 
0 
0 
0 



0 
0 
0 



p .2 0 .2 .5 .4 .7 .4 

vari .16 0 .16 .25 .24 .21 .24 
ALPHA - 1.166667 TIMES (1 - (1.26 / .84)) = -.5833333 



1 
1 
1 
0 
1 
0 
0 
0 
0 



3 
2 
2 
2 
3 
3 
1 
3 
1 
4 



.84 
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Tabltt 6 

Results froa ths Fo\ir Models Using the Monte Carlo Method 



VARIABLE 



■mean std dev minimum maximum 



ALPHA 
TESTVAR 
IVARSUN 
ITEMPSD 

HfidftI-2 

ALPHA 

TESTVAR 

IVARSUM 

ITEMPSD 

ALPHA 
TESTVAR 
IVARSUM 
ITEMPSD 

HfidfiL-i 

ALPHA 

TESTVAR 

IVARSUM 

ITEMPSD 



(MAXIMUM, 

.940 
8.884 
1.702 

.074 
(MINIMUM, 
-.146 
1.564 
1.606 

.134 
(MODERATE 

.656 
2.876 
1.210 

.187 
(MINIMUM, 
-.232 
1.295 
1.353 

.186 



MAXIMUM, HOMOGENEOUS) 

.026 .906 .984 10 

1.134 7.610 11.050 10 

.028 1.660 1.740 10 

.021 .035 .105 10 

MAXIMUM, HOMOGENEOUS) 

.377 -.718 .537 10 

.585 1.040 3.040 10 

.053 1.520 1.680 10 

.026 .099 .175 10 

, MODERATE, HETEROGENEOUS) 

.138 .292 .778 10 

.475 1.600 3.200 10 

.083 1.030 1.32C 10 

.024 .139 .?^5 10 

MODERATE, HETEROGENEOUS) 

.509 -.833 .539 10 

.536 .760 2.250 10 

.178 1.100 1.580 10 

.043 .120 .251 10 



Table 7 

Rank of Influence of Factors Affecting Coefficient Alpha 



" X 



IVARSUM 
ITEMPSD 



« X3 « TESTVAR 



Coefficient Used 


First 


Second 


Third 


fl-weight 




-x^ 




Structured 


X3 




X2 
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Plgur* 1 

Histograms illustrating total tast variance with a 7-ite», 
dichotoBOUsly scorad test with 10 exaainees 



X X 

X X 

X X 

X X 

X X 

X X 

X X 

X X 

X X 

X X 

_2L -2L 



01234567 01234567 

Total Test Score Total Test Score 

For each item, p - 1 ^ox each item, p 

- 0 ffT^ - 0 



X X 

X X 

X X 

XX X X 

XXXXXXXX -X ____x_ 

01234567 01234567 
Total Test Score Total Test Score 

a^2 . 3.21 - 12.25 
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Figur* 2 

Thtt 11 Possibltt Varianctts for a OichotOBOusly-Scorad Itaa 

CoBplatad by 10 Exaainaas 



I 

i 
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Flgur* 3 ^ , w 

Linwr regression line for predicted alpha versus observed alpha 




PREDICTED ALPHA 
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